General-Purpose Text Categorization Applied to the Medical Domain
نویسندگان
چکیده
This paper presents work where a general-purpose text categorization method was applied to categorize medical free-texts. The purpose of the experiments was to examine how such a method performs without any domain-specific knowledge, hand-crafting or tuning. Additionally, we compare the results from the general-purpose method with results from runs in which a medical thesaurus as well as automatically extracted keywords were used when building the classifiers. We show that standard text categorization techniques using stemmed unigrams as the basis for learning can be applied directly to categorize medical reports, yielding an F-measure of 83.9, and outperforming the more sophisticated methods.
منابع مشابه
A Comparison of Text Categorization Methods
In this paper firstly I have compared Single Label Text Categorization with Multi Label Text Categorization in detail then I have compared Document Pivoted Categorization with Category Pivoted Categorization in detail. For this purpose I have given the general definition of Text Categorization with its mathematical notation for the purpose of its frugality and cost effectiveness. Then with the ...
متن کاملA Sociological Definition and Categorization of Information Ethics
Background and Aim: This paper aims at the analysis of the definitions and categorizations of the realm of “Information Ethics” to criticize assumptions and clarify points of departure for introducing a new definition and categorization. Method: I used documentary research method and conceptual analysis approach. This method and approach is the best fits with the goal of pursuit roots of social...
متن کاملGetting to the (c)ore of knowledge: mining biomedical literature
Literature mining is the process of extracting and combining facts from scientific publications. In recent years, many computer programs have been designed to extract various molecular biology findings from Medline abstracts or full-text articles. The present article describes the range of text mining techniques that have been applied to scientific documents. It divides 'automated reading' into...
متن کاملEditorial Volume 5, Issue1
Applied Literature, however, does not have literature at its centre. Literature in this domain is a tool to solve problems and achieve goals. Using literature to teach and learn languages, the application of literature to language education, is a very handy example. Health Humanities (by Crawford, et al. and reviewed by A. Ramazani in our Journal's previous issue) comprises chapters on how lite...
متن کاملText categorization using automatically acquired domain ontology
In this paper, we describe ontology-based text categorization in which the domain ontologies are automatically acquired through morphological rules and statistical methods. The ontology-based approach is a promising way for general information retrieval applications such as knowledge management or knowledge discovery. As a way to evaluate the quality of domain ontologies, we test our method thr...
متن کامل